DOCtfflEBT BESDHE 



ED 207 ,022 

AOTBOB :/ 
TITLE 

\ WB DATE 
HO^B 



BDBS PBICE 
DESCRIPTOBS 



CS 006' 270 - 



IDEHTIPIEBS 



Anderson, .Jonathan n - i 
Analysing the Beadability of English and Hon-Bnglish 
Texts in the classroom with Liz. 
Aug B1 s r \ - 

19P«; Paper presented at the Annual Meeting of the 
' Australian Beading Association (Darwin, Australia, 
August 19 81). * \ 

HF01/EC01 Plus Postage. 

Co&tent Analysis; Difficulty Level; English; foreign 
Countries;, *Languages; *HeasurGment Techniques; 
Beadability; Readability Formulas; *£eadiag 
BesearQh; Textbook' Evaluation 
*Lix Beadability Formula; Sweden 



ABSTBACT , . , 

1 "LasbarhetsJjidex" <"Lix*)' is a readability formula 
developed in Sweden that holds promise for assessing text difficulty 
in 'other languages, including English. So far three separate studies 
have been conducted to test Lix^rith French and English texts, vith 
German and English texts^ and with TSreek and English texts* In all 
three' cases High correlations were found between the Lix scores ■ 
across languages. The steps for computing Li^ scores are as follows; 
(1) count the total numbeX of wor4s r the number of long words (mote 
than six letters) , and the number, of sentences in the teaft; (2) 
compute the percentage of long words in the text; (3) compute the 
average words per sentence; apd (4) add' the two values for 2 and 3 
and round to the" nearest whole number. Scores usually range from 20 
("very easy") to 60 ("very 'difficult*) • Although the research basel y 

, *tq$ Lix is preliminary and certain language Variations &eces£l}ttct& 
jestablishing norms for Lix scores across languages, the Lix formula 
appears to be easy to compute (assuring ibtercoder reliability) and 

? useful for examining texts a r t,a variety of levels, from young 
children's materials through secondary level and adult textjS. (BL) ^ 
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Paper presented at.thi Seventh Australian Reading .Association Conference* 



There's an obyioua difference in reading ^difficulty between 

Where the amount'of the annuity derived by the ^taxpayer kurmg a , * 
year of income is pore than, or less tdmn, the amount payable for, 
a whole year, the amount to> be excludea from thA amount so derive^ 
is the amount which bears to the,, amount which, but foe this sub- 
sectipn, woalil be the amount .to be so excluded the same proportion 

* as the. amount so derived bears to the amomjt payable for the whole ^ 

* • ^ ■ • ■ 

and ♦ * 

■ 

I went to Sleep with gum in my mouth and now there's, gum in my hair 
an<J when I got out of bed this morning I tripped on the skateboard 
and by mistake I dropped my sweater in the sink while'the water wa^j 
running and I could tell it wag going to be a terrible, horrxble, ^ 
no" goad, very bad day. 1 think I'll move to Australia. 

4 

No reading , teacher needs the findings of readability research to detect 
that the first sentence is a prime example of gobbledygook - a delightful 
onomatopoeic term coined by Maverick, a U.S. government official, to 

r 

* * * * * + 

describe verbose," pompous, 9 often unintelligible, double-ta^lk* He^says that 
the language, of Washington reminded him of the. old, bearded turkey-gobbler 
from his home in Texas, %d>o was always gobbleciygobblingand strutting with 



ridiculous prompbsity. At the the end of his gobble there was. a stfrt of 
gook". , " ' " 



^It may come as no surprise that the first text is quoted from the + 
Australian income Tax Act. 'But what are the cHaracteristics of the second 
text (from "Alexander and. the Terrible, Horrible, No Good, t Very Bad Day!" 
by Judith Viorst) which make it p resume a&ly more comprehensible th^fl the 1 * 
first? This ( articlf looks at techniques which the classroom 'teacher may % 
use, not to* dispense with professional judgment, but to supplement it. In 4 

* m m * \ * 

particular, a little-known technique is highlighted which may prove especially 

useful in the multicultural i classroom environment with English' and tfith 

• * v * i m 1 

v ■ 

other languages. • . V 



An flvefactionist viewpoint 
> - V 



Heading from the top" down , the theme thfe Darwin conference, refers 

/ A • 

„of course not only to the Noith^cn Territory's geographical location. The 

, phrase has been used to encoiqpass those theories 02? models which are to 

• « 

^be distinguished from others with a bottom up orientation/ A similar 

distinction has 'Seen madej^between inside -out arid outside_in theories 

^ tCainbourne 1973) J \^J°P down, like inside-out accounts of the reading prpcess 

emphasise that 'meaning, is \fc>rou$ht to texts primarily from inside the head 

/{the- top) , ,rather than the other way round. 

* • . 

^ Before' x analysing( / ^eatures of text, it is uri^rtant* to consider the 

place of text, not only within the, reading process a$ suggested "by phrases 

" like top down, but within the total writing-reading process*. The view 

favoured here is an ^nteract^onist one; both' writer and reader interact 

1 with text, and writing and reading always occur within a gives context, 

I # •+ • • 

- each interacting with the other. Diagrannnatically the position may be 

portrayed as in Figure, lc ■ . * * 



WRITER 










<H — 


— » 


READER 






. » 



CONTEXT 



FIGURE 1 



Diagrammatic representation of the^ writing-reading process 



t * 



It is not the place here to Explore the various/interactions in Figure 1' 

** " 9 v L* * 

further* Although the focus below is on specific aspepts ^£ text, there 

* / * * P ' 

would be no text without* a writer, and, similarly, reading (of print) requires 
* - X ^ r 

both reader aj^/l text. Itlis for reasons of convenience ax^d managability 



only that text features aie>dis cussed and analysed as though their Effects 

were independent* • > 4 

• \ i* ' V ( ^ • - 

x t 

* c 

Readability of non-sfiglish texts * * 

* * 

There have been three major summaries of readability research in the, 

past twenty years. In Kla*e*s (1963) revieV appeared the comment that 

i 

"very little readability research and formula development have taken 
place in other countries" (p. 98). Thi^s comment apparently applied not 

v only to countries where English was not the mother tongue but to all 
countries other than the .United States. Consequently, Klare continued, 

* "this has made it ratfier natural for American research "workers to apply 



American formulas to foreign language material" (p. 98). The exceptions 

noted were the work of Tharp with French materials, Spaulding's development 

of two formulae for 'Spanish, and Shiba-s application of cloze procedure to 

/ • % 

Japanese. - 

During the next decade the United Kingdom Reading Association monograph 

(Gilliland* 1972) , entitled Readability , was published. Although this latter 
• - \ 

book described readability research in Britain (notably that of Mugford) ; 
Klare 's comments about *the lack of formula development in other (non-English) 

languages. still applied fox Gilliland did not address the problem of readability 

' / * 
languages other than English. 



\ 



\ languages 

The most recent work oil readability, also from the United Kingdom, is 
Harrison' a (1980) Readability in the Classroom . Again-^ surprisingly, no 



•erJc 



reference 0 i$ made to the problem of gauging the readability of non-Engliah 
j materials., . ' . ' * 

• p I 

If education for a multicultural society is one of the key issues 

for reading teachers in the coming decade (se'e Anderson 1981} , the general 

ss of foreign language readability assessment, as evident * 
^ « * ■ * 

in the research summaries cited, seems a major shortcoming. And yet there 

is a readability formula, developed in Sweden, which holds promise for* 

assessing text difficulty in other languages^ including English. Below its 

use with French, German, Greek and English/texts is reported. 

The Swedish formula 

. A few years before the appearance/ of Gilliland's book, Readability , 
Bjdrns&son (1968) published a book entitled Lasbarhet . £y coincidence he 
selected the same one-word title as/ did Gilliland, for lasbarhet is Swedish 
for readal^iity. Bjornsson's booy appears to be little known'outside - 

/ *> - 

Sweden^ (no doubt because relati\jely few people know Swedish), y^t it deserves 

to be brought to the attention/of classroom teachers more widely. 

* / . - V 



t 



Bjornsson (1968) details the development o£ a new readability formula - 

^ / * • ' \< * ' 

from the selection of 12 features of text known to contribute to reading 

' *'.*/. 

difficulty, through to ^the measurement of each of these Across 18 books in . 

I 

each of the 9 levels of, the Swedish comprehensive school (i.e., 162 books 

* 

/ 

in all), finally to the use of the traditional regression approach to 

/ ' - . * 

determine which of the measured tw*' features best estimated .teachers* (apd, 

f v , " — 1 — 

rather 'interestingly, pupils 1 ) Judged boofc\dif£iculty. 

/ ■ x -\ - • . • x ••• 

Each of the 12 readability' factors was .tested against -such requirements 

/ • ' \ * ' 
as validity, reliability, objectivity, and ease of computation. After miich 

trialling and culling of factors, BjornSson, like other researchers before 

him, found that two' factors 4 a word factor and a sentence factor - could 



-1 



j? 



- / 

be used to predict readability quite accurately.' He called his readability 
index jSsba r he ts index .which, in accordances/with Zipf's I£w, in til^^s * 
shortened to Lix. j i 

? . • 

The word factor in Lix Is the familiar word length variable but it is 

2 * , 

measured differently from most^other Readability formulae. Rather thart ra 

/ _ * * ' 

1 count of syllables, polysllabic worqfsr, or unfamiliar words as judged by 

a word list, word length for Lix is guaged by the percentage of long words 

(i.e., words of more than six letters). This method of gauging word^ length 

was considered to make Lix more .objective and quicker to Compute than other 

r * — 

formulae. The sentence factor ."adopted (sentence length) is the average 

r m 

number of t words per sentence, as in the Flesch, -Spache, Fry and many other 

. - i 

measures pf readability. Finally, word and sentence factors are weighted 

■ * * » * * 

t 

equally, atod this, too, contributes to ease of calculation. Lix is then 



r 



•V 



defipecl as fol-i^wsj 

Lix =*word -length 4; sentence length 

\ * ■ • 

I ■* * * 

where wbrd length * percentage of words of n&re than six . * B 

w letters; / 

? * 
and sentence length ■ average number bf words pet sentence.. 

Nelson (1977), irr'conimentirjg upon the computation, of Lix notess 

Lix is not only simple', to ^calculate, avoiding as it does, 
mathematical formulae, It differs from the English readability 
• measures in. two important ways. Firstly, it bypasses the £tfoBlem 
of whether to count monosyllabic wOrds, polysyllabic words of . 
total syllables by including only words beyond a certain length; 
that it is a measure tyhich ignoresVhe linguistic rules of syllabification 
suggests that it is pdfentially useful across languages. Secondly, 
Lix calculates the percentage of long words from 100*word samples 
while sentence length is, computed from separate 10 sentence samples. 

* 

Trialling Lix with, French and English"- texts 1 




In the first trial of Lix tfo be reported here, a graduate student at 
* ( * * 

Flinders University, in an unpublished report (Nelson 1977), tested Lis 



with French and English texts. This was a small independent project undertaken 

x eric . .... . . • 7- : * . 5 



as part of the Diploma in Education and 10 books 8f fictipn uped in upper 

primary and lower secondary school were selected. The books, however, wetfe 

selected b^cau^e of the availability of a translation in the other language. 

Some pf the books 'were English titles for which there was a French translation 

while. others were French titles for which there was an English translation. 

Samples of text were selected randomly from each book and then parallel 

i - ' ■ 

samples were identified in the translation. 

■ 

As a test of Lix # 'indices were calculated for each'title in both French 
• and English. Bjbrnsson's recommended procedure, of selecting a 2000-word 

sample (made up of 20 100-word sariples) and a separate sample of- 200 sentences 
(20 samples each of ,10 sentences) was followed. In order to compare these^ 
estimates of reading difficulty w^th some other acceptable measure, Flesch' s 

Reading Ease Formica {Flesch 1948) was T also applied over the 2000-word 

P t 

' /Sample from each title, in both languages. Flesch's formula was judged 
• appropriate i?or the levels of difficulty of the texts, at least as far as 
English was concerned. <* 



■j 



The resulting correlations are of interest. Lix indices in French 



correlated 0.87 with Lix indices^in English; while Flesch iridices across, 
languages correlated 0.90. Bpfcn correlation coefficients suggested that 
the factors being measured in French and English by these two formulae were 
very similar. When Lix and Flesch were correlated over the French books, 
the correlation was -0.80 while over the English books it was ,-0.78 
(tl\e negative correlations resulting because difficulty is^ rated high on Lix^ 
but low on Flesch) . When jit 'is remembered that the Eix sample from each book 
was not identical to the Flesch sample (because the former included an 
additional 200 sentences), ihpse correlations are indeed high. If Flesch 
is considered to provide a valid measure of text difficulty, then Lix wSuld 
appear to be measuring text difficulty similarly. 



Further trials with German and English texts < 

*A secqnd trialling of Lix was Undertaken* for another project within 

the Diploma in Education at Flinders University (D'Antuqni 1978) , this 

time with German and English texts . From a population of 204 twentieth * 4 

• « • • 

century German worRs of fiction fin the University Library, 20 were randomly 

selepted; independent word and sentence samples were drawn from each, . 

and then these German samples were translated into English. This procedure 

* - * 

m « * 

of selecting text samples £or analysis was adapted in order to reduce the 
variability between languages, by matching pairs of sample^bs closely as 
possible. JWhen translating, each German sentence "was read and then the 
idea involved was expressed as accurately as possi in English" (p*9)* 

As in the project by Nelson, German Lix scores were correlated with 

English Lix scores. (The Felsch formula was not applied}. The correlation 

for the sample of 20 novels across languages was 0.89.* This high degree 

of correlation is evidence that Lix is consistent across German and English. 

\ * 
The result is very similar to that obtained by Nelson (19771 where the 

correlation of Lix< across Fxfcnch and English was 0*87. * ^ 

Although the correlations reported in the Nelson (1977} and D 4 Antuoni 

»• • . 

(1978} studies across languages are high, it does not follow that the 
f i ' ^ 

magnitude of the Lix scores in the three languages is similar. As might 
have been expected, the mean Lix score for German, was higher than that for 

' < • J ' 

English. This is because of the pattern^ in German for words and morpheme^ 

. ■» 
to coalesce to form longer words, thus increasing the percentage of long 

words* At the same time, sentence length is ponsequently decreased. The 

net result* due to the 'much greater difference in the percentage of fohg 

words between German and English, was "consistently higher scores for German 1 ' 

^Antuoni 1978, p*J.l) t . Similarly, Nel'son (1977) reported higher lax 

scores tot French fiction compared with the English translations. These 




findings suggest th$t, % if Lix is to be readily interpreted ipr languages 
besides Swedish, then norms need to be established for Other languages 

Stely. . * -> 

Analysing Greek and English texts \ * 

» * - 

In a third project, undertaken as papt of the Diploma in Education at 
Flinders University (Alexandrou 1980), Lix was trialled with a set 
children's readers ( City Kidb) . These are printed in Greek, Turkish, Italian, 

m m 

and English, with a no-text edition to cater for other ethnic languages. 

Alexandrou examined all 24 Greek booklets and the 24 English parallel texts 

and then Lix and the Edwards readability formula (Edwards 1977) were applied. 

This latter formula is an Australian adaptation of the Spache (195i) formula, 

designed for reading material in the early grades (1 3) . It is based on 

» 

a list of so-called easy words, the major modifications for Australian 

schools being the change of American to Australian usage .(eg., candy ty> lolly) , 

the inclusion of ccarmon words from current .Australian reading series; and 
» \ 
• t * * * 

the omission of certain less common words (eg., Indian) . Before the Edwards 

* ' * p* 

modified word list could be used with Greek texts, it needed to be translated 

* * * 

into Greek, and this was a useful by-product of the project. 

As ,in the prior trialling with German and French texts* Lix scores in 

& * _ 4 /* v 

Greek wer£ correlated vitH Lix scores in English over all books. Due to 

; * 

the size of the books, the total text in each was analysed. The resulting 
correlation was 0.81. The correlation for the Edwards formula across 
languages was 0.77. These correlations are only marginally lower than 

K T \ 

those obtained between German and English And between French and English 

textS, which may well be explained by the yery much smaller samples 

* 9 ^ 

available -in Alexandrou 1 s study*. ' 



V 9 <~ 

interpreting Lix * o ^ ^ * 



* 



To illustrate how Lfx may, be applied inrfc.he classroom and how Lix' 
* ' % \ * 

scores may be interpreted, we return to the* two texts quoted at the 



beginning of this article: the Tax Act and the Very Bad Day . The .necessary 

* ' *' i 

• calculations are. as follows-: *J t 

1* Count for each text 

(a) the total number of words, , * ^^Sf* 

/ * 

/ (b) the number of long words {i.e.*, words of more than 6 letters), and 

/ ■'''"/• 

t' (c) the nuijiber of sentences. • • \ 

. * ^ 

2. Compute word length (percentage of long words); divide (b) by (a) and 

multiply by 100. 

3. Compute sentence length (average length of sentences in words): divid$ 
(a} by (c). ' ' 



4. Add the two values obtained in (2) and (3) and round to the nearest whple* 
number. \ 

< 

These calculations are shown in T&ble 1. 



r 

\4 



TABLE 1 



\ 



Computation of Lix for two texts 



Texts . Tofcal no. No. long * No. of % long Av. sentence Lix 

words words *• sentences words length ~ • 



Tax Act 76 .11 



Very Bad 



14.5* 76.0 91 



% 



Day 68 11 2 16.2 34.0 , 50 



The calculations above illustrate how easy Lix is to compute. The 

counts are quite objective and thus inter-coder reliability is ensured, 

- 

» ♦ 

In addition to ease of calculation, there are certain attractive features 

as far^as interpretation of Ijix scores are concerned, which generally.. do 
not hold fpr other readability formulae. First, the equal weighting of 
word and sentence factors means that the relative contribution of each of 
these factors is more readily apparent^ , In the two sample texts, for instance 
sentence length clearly contributes more than word length to tofcal'Lix 

"score (84 per cent of Lix for. tile Incctoe Tax text 'and 68 pe*r cent of L^x 

i 

for the Very Bad Day text). Second, the word and sentence factors aj:e 

themselves readily interpret able. Thus, the .word* length of 14-5 ( Income 

% > >* 1 

Tax) means that about onp in seven words is a long word; and t^e word 

,* * " ■ *f i t 

length of 16 ♦ 2 ( Very Bad Day ) mews that every sixth word on average ip a 

• " . ** 
loncj word. (It *aay surprise that tjiis latter text contains the higher' 

I J ' *' • l'* 

percentage of JLdng words/. Just as the word length components of -Lix have 

. f' 

meaning, so do the sentence length components. In the Income Tetx text ,the 

sentence length .factor indicates that the average sentence length is ^6 

I 

words while in the second text it averages 34 words, „ 1 



There are other text factors besides those incorporate^ in Lix that 

* " . " i 

can make for reading difficulty. As well> factors within the reafder and 
purposes, for which the texts are written interact with text feature^ to cause 
a partic^Lar text to be either ^easy ,or difficult reading* Li^, like other - 
readability measures, provides* only an in'exact and incomplete estimate of 
difficulty ♦ * It: yields certain information about texts which, like anf 

• *• ' . - \ /* ; 

other information (e»g., content, form and layout), needs to be evaluated . 
in text selection* „ • 1 / V 

, Bjomsson (1968) provided "the following table for interpreting Lix 

„ • * 

scores, (Table 2). However, it is important to note tbat these nprms are - 



* % ^ * » • „ - ^ t ■ * ***** 

# f or Swedish a^d that, as yet, there «e no similar norms for interpreting 
- 4 Lix scbr&s^in other languages except, as above, /by. comparing one text with 



• 'inothfir: 1 



v • * » ♦ 

i 



* - . ' 5 TABLE 2 * 
Interpreting lix scores. (Swedi.sh texts 



— ^ : ^ 1, 

Text difficulty* 

' . , * V 


-tfr? 

r * 

i 


• : » * 


n ; ? ' — e > \ A 

* * » ■ > v 

* # i . - . » • f ■ • 
■ * ■ ' • 1 / 
"Very jeasy^ " ^ 1 

*" v 1 ' '* C * 


>« • ■ t" 

f r • * » 

• • * 


■ ■ *>r 
•^-^ ;. 

' ■ -20 % ' 




\ • 

* % 


25 

* 




r 


■'' 30 1 


* 




35 ■ 


Mediunj 


• 


40 <r 

* 




• 


45 *' * 


Difficult ' j t 
* 




50 
55 

« 


Very difficult 


. Y '• 


60 


/ 


*• 


4 



I 



/ It is important tb note also that the two texts illustrated above* are 
Ve^ry Short (one and two sentences jressfcectively) and no generalisations 
ought to be ma4e With sample^ of this pi2e* .The three studies reported 



using French, German, Greek 



.and\nglish, texts are also small-scale research \ 
studies. Nevertheless, theje is a consistency about the. finding^ that 
suggests Lix mayibo*traeful across^ languages and that Lix may be used at a 

- * m * * * • 

■ * * * * m 

,f variety of levels from young children 1 s n^terials through to secondary-* 

" # « i— * * * 

.level and adult texts. ' • ' . * , e 

• ' A " ' • ) * • • *• • ' * * 
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